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Abstract 

In this work we address a game theoretic variant of the shortest path problem, 
in which two decision makers (players) move together along the edges of a graph 
from a given starting vertex to a given destination. The two players take turns 
in deciding in each vertex which edge to traverse next. The decider in each 
vertex also has to pay the cost of the chosen edge. We want to determine the 
path where each player minimizes its costs taking into account that also the 
other player acts in a selfish and rational way. Such a solution is a subgame 
perfect equilibrium and can be determined by backward induction in the game 
tree of the associated finite game in extensive form. 

We show that the decision problem associated with such a path is PSPACE- 
complete even for bipartite graphs both for the directed and the undirected 
version. The latter result is a surprising deviation from the complexity status 
of the closely related game Geography. 

On the other hand, we can give polynomial time algorithms for directed acyclic 
graphs and for cactus graphs even in the undirected case. The latter is based 
on a decomposition of the graph into components and their resolution by a 
number of fairly involved dynamic programming arrays. Finally, we give some 
arguments about closing the gap of the complexity status for graphs of bounded 
treewidth. 
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1. Introduction 


We consider the following game on a graph: There is a directed graph G = (V, A) 
given with vertex set V and arc set A with positive costs c(u, v) for each arc 
(u, v) & A and two designated vertices s, t G V. The aim of Shortest Path 
Game is to find a directed path from s to t in the following setting: The game 
is played by two players A and B who have full knowledge of the graph. They 
start in s and always move together along arcs of the graph. In each vertex the 
players take turns to select the next vertex to be visited among all neighboring 
vertices of the current vertex with player A taking the first decision in s. The 
player deciding in the current vertex also has to pay the cost of the chosen arc. 
Each player wants to minimize the total arc costs it has to pay. The game 
continues until the players reach the destination vertex tl3 Later, we will also 
consider the same problem on an undirected graph G = (V,E) with edge set B 
which is quite different in several aspects. 

We will impose two restrictions on the setting in order to ensure that vertex t is 
in fact reachable and to guarantee finiteness of the game. Even in a connected 
graph, it is easy to see that the players may get stuck at some point and reach a 
vertex where the current player has no emanating arc to select. To avoid such a 
deadlock, we restrict the players in every decision to choose an arc (or edge, in 
the undirected case) which still permits a feasible path from the current vertex 
to the destination t (which is computationally easy to check). 

(Rl) No player can select an arc which does not permit a path to 

vertex t. 

In the undirected case, it could be argued that for a connected graph this re¬ 
striction is not needed since the players can always leave dead ends again by 
going back the path they came. Clearly, this would imply cycles of even length. 
Such cycles, however, may cause the game to be infinite as illustrated by the 
following example. 

Example 1. Consider the graph depicted below. Player B has to decide in 
vertex v whether to pay the cost M ^ 2 or enter the cycle of length 3. In the 
latter case, the players move along the cycle and then A has to decide in v with 
the same two options as before for player B. In order to avoid paying M both 
players may choose to enter the cycle whenever it is their turn to decide in v 
leading to an infinite game. 


^ We will always assume that a path from s to t exists. 
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The general setting of the game is representable as a game in extensive form. 
However, there is no concept in game theory to construct an equilibrium for an 
infinite game in extensive form (only bargainin g m odels such as the Rubinstein 
bargaining game are well studied, cf. Osborne 13|, ch. 16]). Thus, we have to 


impose reasonable conditions to guarantee finiteness of the game. 


A straightforward idea would be to restrict the game to simple paths and request 
that each vertex may be visited at most once. While this restriction would lead 
to a simpler setting of the game, it would also rule out very reasonable strategic 
behavior, such as going through a cycle in the graph. Indeed, a cycle of even 
length can not be seen as a reasonable choice for any player since it necessarily 
increases the total costs for both players. However, in the above Example [T] it 
would be perfectly reasonable for B to enter the cycle of odd length and thus 
switch the role of the decider in v. Therefore, a second visit of a vertex may 
well make sense. However, if also A enters the cycle in the next visit of v, two 
rounds through the odd cycle constitute a cycle of even length which we rejected 
before. Based on these arguments we will impose the following restriction which 
permits a rather general setting of the game: 


{R2) The players can not select an arc which implies necessarily a 
cycle of even length. 


Note that (R2) implies that an odd cycle may be part of the solution path, but 
it may not be traversed twice (thus reversing the switching between the players) 
since this would constitute a cycle of even length. This aspect of allowing odd 
cycles will be taken explicitly into account in the presented algorithms. In the 
remainder of the paper we use “cycle” for any closed walk, also if vertices are 
visited multiple times. It also follows that each player can decide in each vertex 
at most once and any arc can be used at most once by each player. 

A different approach to guarantee finiteness of the game would be to impose an 
upper bound on the total cost accrued by each player. This may seem relevant 
especially for answering the decision version where we ask whether the costs for 
both players remain below given bounds (see Section [2]). Clearly, such an upper 
bound permits a clear answer to problems such as Example [TJ In that case, the 
precise value of the bound defines which of the two players has to pay M after a 
possibly large number of rounds through the cycle. Anticipating this outcome, 
the unfortunate player will accept the cost M the first time it has to decide in 
V. However, it should be pointed out that a slight change in the upper bound 
may completely turn around the situation and cause the other player to pay M. 
Thus, the outcome of the game would depend on the remainder of the division of 
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the bound by the cycle cost. This would cause a highly erratic solution structure 
and does not permit a consistent answer to the decision problem. 

As an illustrative example of the game consider the following scenario. Two 
scientists A and B meet at a conference and lay out the plan for a joint paper. 
This requires the completion of a number of tasks. Each task requires a certain 
working time (identical for both A and B) to be completed. Moreover, there 
is a precedence structure on the set of tasks, e.g. the notation and definitions 
have to be finished before the formal statement of results, which in turn should 
precede the writing of the proofs etc. Obviously, this implies a partial ordering 
on the set of tasks. We can represent any state of the paper by a vertex of a 
directed graph with an arc representing a feasible task whose completion leads 
to a new state (=vertex). 

To avoid confusion and a mix-up of file versions the two scientists decide to 
alternate in working on the paper such that each of them finishes a task and 
then passes the file to the other co-author. Since none of them wants to patro¬ 
nise the other they have no fixed plan on the division of tasks, but each can 
pick any outstanding task as long as it is a feasible step to a new state of the 
paper. They continue in this way until the paper is finished, i.e. the final state 
is reached. Since both scientists are extremely busy each of them tries to mini¬ 
mize the time devoted to the paper taking into account the anticipated rational 
optimal decisions by the other. 

More generally, the considered game applies to situations in which a system 
needs to be transformed from status s to status t by traversing several transi¬ 
tional status’ in alternately performed steps. 

1.1. Solution Concept 

In classical game theory Shortest Path Game is a finite game in extensive 
form. All feasible decisions for the players can be represented in a game tree, 
where each node corresponds to the decision of a certain player in a vertex of 
the graph G. 

The standard procedure to determine equilibria in a game tree is backward 
induction (see Osborne [l^ ch. 5]). This means that for each node in the game 
tree, whose child nodes are all leaves, the associated player can reach a decision 
by simply choosing the best of all child nodes w.r.t. their allocated total cost, 
i.e. the cost of the corresponding path in G attributed to the current player. 
Then these leaf nodes can be deleted and the pair of costs of the chosen leaf is 
moved to its parent node. In this way, we can move upwards in the game tree 
towards the root and settle all decisions along the way. 

This backward induction procedure implies a strategy for each player, i.e. a rule 
specifying for each node of the game tree associated with this player which arc 
to select in the corresponding vertex of G: Always choose the arc according to 
the result of backward induction. Such a strategy for both players is a Nash 
equilibrium and also a so-called subgame perfeet equilibrium (a slightly stronger 
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property), since the decisions made in the underlying backward induction pro¬ 
cedure are also optimal for every subtreeH 

The outcome, if both players follow this strategy, is a unique path from s to 
t in G corresponding to the unique subgame perfect equilibrium (SPE) which 
we will call spe-path. A spe-path for Shortest Path Game is the particular 
solution in the game tree with minimal cost for both selfish players under the 
assumption that they have complete and perfect information of the game and 
know that the opponent will also strive for its own selfish optimal value. 

Clearly, such a spe-path path can be computed in exponential time by exploring 
the full game tree. It is the main goal of this paper to study the complexity 
status of finding this spe-path. In particular, we want to establish the hardness 
of computation for general graphs and identify special graph classes where a 
spe-path can be found without exploring the exponential size game tree. 

An illustration is given in Example [2] Note that in general game theory one 
considers only outcomes of strategies and their payoffs, i.e. costs of paths from 
s to t in our scenario. In this paper we will consider in each node of the game 
tree the cost for each player for moving from the corresponding vertex v oi G 
towards t, since the cost of the path from s to n does not influence the decision 
in V. This allows us to solve identical subtrees that appear in multiple places of 
the game tree only once and use the resulting optimal subpath on all positions. 



Example 2. Consider the above graph and the associated game tree. The spe- 
path is determined by backward induction and represented by ordered pairs of 
cost values {x, y) meaning that the decider in a given vertex has to pay a total 
value of X whereas the opponent has to pay a value ofy. 


^In order to guarantee a unique solution of such a game and thus a specific subgame perfect 
Nash equilibrium, we have to define a tie-breaking rule. We will use the “optimistic case”, 
where in case of indifference a player chooses the option with lowest possible cost for the 
other player. If both players have the same cost, the corresponding paths in the graph are 
completely equivalent. Assigning arbitrary but fixed numbers to each vertex in the beginning, 
e.g. 1,..., n, we choose the path to a vertex with lowest vertex number. 
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Note that if the game would be played cooperatively the shortest path of value 9 
would give a lower total cost than the optimal solution of (10,2) in our case. 


In this setting finding the spe-path for the two players is not an optimization 
problem as dealt with in combinatorial optimization but rather the identification 
of two sequences of decisions for the two players fulfilling a certain property in 
the game tree. 

Comparing the outcome of our game, i.e. the total cost of the spe-path, with 
the cost of the shortest path obtained by a cooperative strategy we can consider 
the Price of Anarchy (cf. 0 ) defined by the ratio between these two values. 
However, it is easy to see that the Price of Anarchy can become arbitrarily large, 
e.g. in the following example, where the ratio is . 





1.2. Related literature 

A closely related game is known as Geography (see Schaefer 0). It is played 
on a directed graph with no costs. Starting from a designated vertex s G V, 
the two players move together and take turns in selecting the next vertex. The 
objective of the game is quite different from Shortest Path Game, namely, 
the game ends as soon as the players get stuck in a vertex and the player who has 
no arc left for moving on loses the game. Moreover, there is a further restriction 
that in Geography each arc may be used at most once. 

Schaefer 0 already showed PSPACE-completeness of Geography. Lichten¬ 
stein and Sipser 0 proved that the variant Vertex Geography, where each 
vertex cannot be visited more than once, is PSPACE-complete for planar bi¬ 
partite graphs of bounded degree. This was done as an intermediate step for 
showing that Go is PSPACE-complete. Fraenkel and Simonson gave polyno¬ 
mial time algorithms for Geography and Vertex Geography when played 
on directed acyclic graphs. In Fraenkel et al. 0 it was proved that also the 
undirected variant of Geography is PSPACE-complete. However, if restricted 
to bipartite graphs they provided a polynomial time algorithm by using linear 
algebraic methods on the bipartite adjacency matrix of the underlying graph. 
Note that this result is in contrast to the PSPACE-completeness result of Sec¬ 
tion [3] for Shortest Path Game on bipartite undirected graphs. Bodlaender 
0 showed that Vertex Geography is linear time solvable on both directed 
and undirected graphs of bounded treewidth. For Geography such a result 
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was shown under the additional restriction that the degree of every vertex is 
bounded by a constant - the unrestricted variant however is still open. 

Recently, the spe-path of Shortest Path Game was used in Darmann et al. 
@ as a criterion for sharing the cost of the shortest path (in the classical sense) 
between two players. A different variant of two players taking turns in the 
decision on a combinatorial optimization problem and each of them optimizing 
its own objective function was recently considered for the Subset Sum problem 
by Darmann et al. 

1.3. Our contribution 

We introduce the concept of spe-path resulting from backward induction in a 
game tree with complete and perfect information, where two players pursue the 
optimization of their own objective functions in a purely rational way. Thus, a 
solution concept for the underlying game is determined which incorporates in 
every step all anticipated decisions of future steps. 

The main question we ask in this work concerns the complexity status of com¬ 
puting such a spe-path, if the game consists in the joint exploration of a path 
from a source to a sink. We believe that questions of this type could be an 
interesting topic also for other problems on graphs and beyond. 

We can show in Section [Q that for directed graphs Shortest Path Game is 
PSPACE-complete even for bipartite graphs, while for acyclic directed graphs a 
linear time algorithm follows from topological sort (Section 12.2p . These results 
are in line with results from the literature for the related game Geography. 

On the other hand, for undirected graphs we can show in Section [3] that again 
Shortest Path Game is PSPACE-complete even for bipartite graphs by a 
fairly complicated reduction from Quantified 3-Sat while the related prob¬ 
lem Geography is polynomially solvable on undirected bipartite graphs. This 
surprising difference shows that finding paths with minimal costs can lead to 
dramatically harder problems than paths concerned only with reachability. 

In Section |4] we give a fairly involved algorithm to determine the spe-path on 
undirected cactus graphs in polynomial time. It is based on several dynamic 
programming arrays and a partitioning of the graph into components. The 
running time of the algorithm can be bounded by 0(n^), where n denotes the 
number of vertices in the graph. The easier case of directed cactus graphs follows 
as a consequence. We also argue that an extension of this partitioning technique 
to slightly more general graphs, such as outerplanar graphs, is impossible. 

A preliminary version describing some of the results given in this paper but 
omitting most of the proofs and only sketching the algorithms appeared as Dar¬ 
mann et al. 0- 


7 


2. Spe-paths for Shortest Path Game on directed graphs 

In general, there seems to be no way to avoid the exploration of the exponential 
decision tree. In fact, we can show a strong negative result. Let us first define 
the following decision problem. 

Shortest Path Game: 

Given a weighted graph G with dedicated vertices s, t and two pos¬ 
itive values Ca, Cb^ does the spe-path yield costs c{A) < Ca and 
c{B) < Cb ? 

This means that we ask whether the unique subgame perfect equilibrium of the 
associated extensive form game fulfills the given cost bounds Ca and Cb- 


2.1. PS PACE-completeness 

The PSPACE-completeness of Shortest Path Game on general graphs can be 
shown by constructing an instance of the problem such that the spe-path decides 
the winner of Vertex Geography. In fact, we can give an even stronger result 
with little additional effort. 


Theorem 1. Shortest Path Game is PS PACE -complete even for bipartite 
directed graphs. 


Proof. Inclusion in PS PACE can be shown easily by considering that the height 
of the game tree is bounded by 2\A\. Hence, we can determine the spe-path 
in polynomial space by exploring the game tree in a DFS-way. In every node 
currently under consideration we have to keep a list of decisions (i.e. neighboring 
vertices in the graph) still remaining to be explored and the cost of the currently 
preferred subpath among all the options starting in this node that were already 
explored. By the DFS-processing there are at most 2|A| nodes on the path from 
the root to the current node for which this information has to be kept. 


We provide a simple reduction from Vertex Geography, which is known to be 
PSPACE-complete for planar bipartite directed graphs where the in-degree and 
the out-degree of a vertex is bounded by two and the degree is bounded by three 
(Lichtenstein and Sipser 111). For a given instance of Vertex Geography 
we construct an instance of Shortest Path Game, such that the spe-path 
path decides the winner of Vertex Geography: Given the planar bipartite 
directed graph G = {V,A) of Vertex Geography with starting vertex s, we 
can two-color the vertices of V because G is bipartite. For the two-coloring, we 
use the colors red and green and color the vertices such that s is a green vertex. 


Starting from a copy of G we create a new graph H for Shortest Path Game 
as follows: First we assign a cost 1 to every arc e & A. Let M be a large number, 
e.g. M := |H| -|- 1. Then we introduce a new vertex t which we color red, and 


8 



an arc of weight M from each green vertex to t. Next, introduce a green vertex 
z and an arc of weight M from each red vertex to z. Finally, introduce an arc 
of cost 1 from z to t. 


The constants Ca and Cb are set to Ca = 2 and Cb = M. This means that a 
“yes”-instance corresponds to player A winning Vertex Geography. It is not 
hard to see that H is a bipartite directed graph. Note that since the constructed 
graph is bipartite the rule of Vertex Geography saying that each arc can be 
used at most once is equivalent to (R2) in Shortest Path Game. 

Whenever a player gets stuck in a vertex playing Vertex Geography, it would 
be possible to continue the path in H towards t (possibly via z) by choosing the 
arc of cost M. On the other hand, both players will avoid to use such a costly 
arc as long as possible and only one such arc will be chosen. Thus, the spe- 
path for Shortest Path Game will incur < 2 cost to one player and exactly 
cost M to the other player, who is thereby identified as the loser of Vertex 
Geography. This follows from the fact that if both players follow the spe-path, 
they can anticipate the loser. If it is A, then this player will immediately go 
from s to t. If it is player B, then A will choose an arc with cost 1, then B will 
go to z paying M, and A from z to t at cost 1. □ 

Note that the result of Theorem [1] also follows from Theorem |3] be replacing 
each edge in the undirected graph by two directed arcs. However, we believe 
that the connection to Geography established in the above proof is interesting 
in its own right. 


Remark: 

The above theorem is true even under the following restrictions: the indegree 
of the vertices is bounded by two, the outdegree is bounded by three, and the 
degree of the vertices is bounded by four. However the planarity is lost and 
the method for getting a planar graph described in Lichtenstein and Sipser 
does not carry over to Shortest Path Game in an obvious way. 
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£.2. Directed acyclic graphs 

If the underlying graph G is acyclic we can do better by devising a strongly 
polynomial time dynamic programming algorithm. It is related to a dynamic 
programming scheme for the longest path problem in acyclic directed graphs. 

For any directed acyclic graph we can determine in 0(|A|) time a topological 
sort, i.e. a linear ordering of the vertices, such that all arcs point “from left to 
right” (see e.g. [3, ch. 22.4]). W.l.o.g. we can assume that s and t are the first 
and last vertex in this ordering. 

For each vertex v G V we define the following two dynamic programming arrays: 
Pd{v)' minimal path cost to go from v to t for the player deciding in v. 

Pf{v): minimal path cost to go from v to t for the follower, i.e. the player not 
deciding in v. 
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For each vertex v let S{v) := {u | {v, u) £ A} be the set of successors of v. After 
initializing Pdit) = pfit) =0, the algorithm goes through the remaining vertices 
in reverse order of the topological sort and computes for every vertex v. 

u' := argmin{c(u,u) +pf{u) \ u € S'(u)} 

Pd(v) := c(v,u') +pf(u'), Pf{v) :=pd{u') 

The correctness of the computation follows immediately from the structure of 
the topological sort since all vertices in S{v) are processed before v. Without 
elaborating on a formal proof we state: 

Theorem 2. The spe-path o/S hortest Path Game on acyclic directed graphs 
can he computed in 0(|A|) time. 


In Section|4]we will consider another special graph class and derive a polynomial 
time algorithm for Shortest Path Game on undirected cactus graphs. We 
will argue that this result immediately yields a polynomial time algorithm also 
for directed cactus graphs as stated in Corollary [5l 


3. Spe-Paths for Shortest Path Game on undirected graphs 

We will provide a reduction from the following problem Quantified 3-Sat 
which is known to be PS PACE-complete (Stockmeyer and Meyer (I)- 

Definition (Quantified 3-Sat): 

GIVEN: Set X = {xi,... ,Xn \ of variables and a quantified Boolean 
formula 


F = (3a;i)(Va:2)(3a;3)... (Va;„) (j){xi,. ..,Xn) 

where is a propositional formula over X in 3-CNF (i.e., in conjunc¬ 
tive normal form with exactly three literals per clause). 

QUESTION: Is F true? 

Let Cl,..., Cm denote the clauses that make up (p, i.e., (p^xi ,..., Xn) = C 1 AC 2 A 
... Cm, where each Ci, 1 < i < m, contains exactly three literals. Quantified 
3-Sat can be interpreted as the following game (cf. Eraenkel and Goldschmidt 
Q): There are two players (the existential- and the universal-player) moving 
alternately, starting with the existential-player. The t-th move consists of as¬ 
signing a truth value (“true” or “false”) to variable Xi. After n moves, the 
existential-player wins if and only if the produced assignment makes p true. 
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3.1. PSP/KC^-completeness 


The following result shows a notable difference between Shortest Path Game 
and Geography, since Fraenkel et al. showed that Geography is poly- 
nomially solvable on undirected bipartite graphs (while PS PACE-complete on 
general undirected graphs). 


Theorem 3. Shortest Path Game on undirected graphs is PS PACE -complete, 
even for bipartite graphs. 

Proof. Inclusion in PS PACE follows from a similar argument as in the proof of 
Theorem[TJ Given an instance Q of Quantified 3-Sat we construct an instance 
<S of Shortest Path Game by creating an undirected graph G = (V, E) as 
follows. The vertices are 2-colored (using the colors red and green) to show that 
G is bipartite. To construct G, we introduce (see Figure [T|): 

• green vertices d,p,r, red vertices w,q,t 

• edges {p,q}, {r,t}, {w,d} and {d,t} 

• for each clause Gj, a green vertex cj 

• for each even i, 2 < z < n, an “octagon”, i.e., 

- red vertices Pi,o, Pi, 2 , Pi. 4 , 

- green vertices Pi,i, Pi, 3 , Pj.s, Pi .7 

- edges 0 < £ < 6, and edge 

• for each odd z, 1 < z < n, a “hexagon”, i.e., 

- green vertices Pi,o, Pi, 2 , Pi ,4 

- red vertices Pi,i,uya, pz.s 

- edges 0 < £ < 4, and edge {Pi, 5 ,Ti,o} 


In order to connect these parts, we introduce: 

• for each even z, 2 < z < rz 

- a green vertex Ui 

- edges {z;i_i, 3 ,zzi}, {zZi,z;i,o} and the edges {Ti, 2 ,r}, {Pi,6,T} 

- edge {z;i,4,Pi-i-i,o}, where Vn+i,o ■= P 

- for each clause Gj, the edge {Ti,2,G} ^ {'^*,6,^} if 

X j £ ^j 

• for each odd z, 1 < z < zz 

- the edges {v^p,r}, {Pi, 5 ,T} 
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— for each clause Cj, the edge {vi^i,Cj} if Xi £ Cj and {vi^ 5 ,Cj} if 
Xi £ Cj 

• for each j, 1 < j < m, 

— edges {q,Cj} and {w,Cj} 

Abusing notation, for 1 < j < n, let Xi := {wi, 0 )and Xi := '*^ 1 , 5 } if * 

is odd resp. Xi := if i is even, i.e., we identify a literal with an edge 

of the some label. For illustration, in Figured] we assume Ci = {xi V X2V X3) 
and C 2 = (xi V Sa V Xn)- 

Finally, we define the edge costs0 We start with the edges emanating from 
vertex w. The cost of edge {w,d} is c{w,d} = 0, all other edges emanating 
from w have cost 4. The edge {r, t} has cost c({r, t}) = 0, while each other edge 
emanating from r has cost 2. The edge {vn, 4 ,p} has cost c({vn, 4 ,p}) = 1- For 
each 1 < j < m, the edges emanating from vertex Cj which do not correspond 
to {cj,w} or {cj,q} have cost 3. The remaining edges have zero cost. 

Note that from the fact that each edge connects a green with a red vertex, it 
immediately follows that G is a bipartite graph. Now, in S we set Ca = 0 and 
Cb = 2, and ask if the spe-path yields costs c{A) < Ca and c{B) < Cb- 

Claim. Q is a “yes”-instance of Quantified 3-Sat 5 is a “yes”-instance 
of Shortest Path Game. 

Proof of claim. Player A starts with the first move along an edge emanating 
from s, i.e., A moves along xi or xi. Due to the fact that s is a green vertex 
and G is bipartite, we can conclude that in a green vertex, A needs to choose 
the next edge, and in a red vertex, it is i?’s turn to choose the next edge. 
W.l.o.g. assume A moves along xi (which, as we will see later, corresponds to 
setting to true Xi in instance Q). Because of (R2), player B cannot move back 
to s along xi- Now, B has three choices. B either moves along (i) edge {ui^, r} 
with cost 2, or (ii) edge {ui^i,ui_ 2 } with zero cost, or (iii) an edge of cost 3 
connecting with some vertex cj. 

Case I: B moves along {uip,r}. 

Then A will use the edge (r, t) of zero cost in order to minimize her own total 
cost and the game ends with c{A) = 0 and c{B) = 2. 

As a consequence of the outcome in Case I, B will not choose (iii), because B 
would end with c{B) > 3 in that case. Note that each vertex Vi^ for odd i (resp. 
uq 2 for even i) is a red vertex, i.e., an analogous situation applies in each such 
vertex. Thus, we can observe the following: 

(Observation 1) Player B does not move along an edge of cost 3, 
i.e., an edge {ui_i,Cj} for odd i resp. {ui^ 2 ,Cj} for even i. 


®In the introduction edge costs were defined to be strictly positive. For simplicity we use 
zero costs in this proof, but these could be easily replaced by some small e > 0. 
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Figure 1: Undirected graph G of instance <S constructed for an instance Q of Quantified 
3-Sat with Cl = (xi V 5:2 V X 3 ) and C 2 = (xi V £3 V x„). 
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Case II: B moves along {wi,i,vi, 2 }- 

In vi^ 2 , player A needs to move along {wi, 2 ,^'i, 3 } (again, A cannot move back 
to r'i _2 along {t'l,!, 1 ^ 1 , 2 } due to (R2)). At vi_ 3 , it is again player B’s turn. 

Case Ila: B moves along {wi, 3 , 1 ^ 1 , 4 }. 

Then A necessarily moves along {ni, 4 ,wi, 5 }, leaving B with the decision in wi, 5 . 
B must not move along xi because of (R2). Because of Observation 1, this 
means that B moves along {ui, 5 ,r} of cost 2. Clearly, A then chooses edge 
(r,t), and again the game ends with c{A) = 0 and c(B) = 2. 

Case Ilb: B moves along {ui, 3 ,'U 2 }. 

In the next step, A moves to U 2 ,o- Now, it is R’s turn to pick X 2 or a; 2 , he., to 
decide which literal she sets to true in instance Q. W.l.o.g. assume B moves 
along X 2 - In the next step A has only one edge to move along, leaving B with 
the decision in V 2 fi- Analogously to above, the game ends with c{A) = 0 and 
c{B) = 2 if R moves along {v 2 fi,r}. If B does not move along {^ 2 , 6 )?'}) the 
players continue until ^ 2,4 is reached, because B does not choose an edge of cost 
3 (see Observation I). Again, it is R’s turn: 

If B moves along {^ 2 , 4 , ^ 2 , 3 }, A necessarily moves along {^ 2 , 3 , r' 2 , 2 }- In W 2 , 2 , B is 
not allowed to move along {^ 2 , 2 , ^' 2 , 1 } because this would result in a deadlock {A 
would be unable to move along another edge due to (RI))- With Observation 1, 
this means that B moves along {w 2 , 2 ) r}; i.e., again the game ends with c(A) = 0 
and c{B) = 2. 

Assume that B moves along {u 2 , 4 ,f 3 ,o}) leaving A to choose in vs^o between X3 
and X3. It is easy to see that in what follows, repeatedly analogous decisions 
need to be made. As a consequence, we can observe the following. 

(Observation 2) If Shortest Path Game ends before vertex Vn ,4 
is reached, it ends with c(A) = 0 and c{B) = 2. 

(Observation 3) If vertex is visited, then for even i player B 
has chosen between Xi and Xi, while for odd i player B has chosen 
between Xi and Xi. 

Assume that vertex is visited, where it is R’s turn to take the next decision. 
Analogously to above we know that, if B does not move along {v„q,p}, then 
again the game ends with c{A) = 0 and c{B) = 2. Assume B moves along 
{vn, 4 ,p} with cost 1. Clearly, in the next step A has to move along {p,q} 
leaving B with the decision in vertex q, i.e., B has to choose a vertex Cj (i.e, a 
clause Cj) to move to. 

Let (j) be the truth assignment that sets to true exactly the literals (edges) chosen 
by a player so far. We will argue that B aims at picking a clause Cj which is 
not satisfied by (j). We will illustrate this by assuming that B decides to move 
to vertex ci representing clause Ci = (xi V X 2 V 0 : 3 ). 

If Cl is not satisfied by </>, i.e., if none of the literals xi,X 2 tX 3 has been moved 
along, then all edges a;i,a; 2 ,X 3 have been used already. As a result, A is not 
allowed to move along the edges { 01 ,^ 14 }, {ci,V 2 , 2 } and {ci,U 3 ^ 5 } because of 
(R2). Again due to (R2), A must not move back to q along edge {ci,q}. Thus, 
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A needs to move along {ci,w} imposing a cost of 4 on player A. In the next 
steps, B obviously moves along edge {w, d}, implying that A moves along {d, t}. 
Thus, the game ends with c{A) = 4 and c{B) = 1. 

On the other hand, if at least one of the literals Xi,X 2 ,X 3 have been used so 
far, then at least one of the edges xi,X 2 ,X 3 have not already been used. As 
a consequence, A - trying to avoid the expensive edge {ci, ui} - is able to use 
one of the edges {ci,uip}, {ci,U 2 , 2 } and {ci,U 3 _ 5 }. W.l.o.g. assume that X 3 has 
not been used and that A now moves along {ci,U 3 _ 5 } with cost 3. At W 3 _ 5 , B 
cannot move along X 3 due to (R2). If B moves along {u 3 _ 5 ,W 34 }, a deadlock 
arises: A is neither able to move back to ^ 3^5 along {w 3 , 5 ,U 34 }, nor to move 
along {u 3 , 4 ,W 3 , 3 }, because this would violate (R2) since both V 3^3 and ^ 3^5 have 
already been visited. Due to (RI), player B hence must not use {u 3 _ 5 ,U 3 , 4 }. 
Consequently, player B possibly needs to choose between an edge of cost 2 or 
edge {u 3 ^ 5 ,r} with cost 2. Clearly, B chooses the latter edge, since in vertex 
r player A will move along {r,t\. Thus, the game ends with c{A) = 3 and 
c{B) = 1 + 2 = 3. 

Summing up the two cases we have: 

If Q is a “yes”-instance then it is better for B to move along an edge towards r 
before reaching p, resulting in an outcome with c(A) = 0 and c{B) = 2. 

If Q is a “no”-instance there is a clause which is not satisfied by (p and B will 
move along resulting in c{A) = 4 and c{B) = 1. □ 

Remark. Using the reduction provided in the above proof, it is not hard 
to show that Shortest Path Game remains PS PACE-complete in bipartite 
undirected graphs, even if (R2) is relaxed in the way that we only require an 
edge not to be used more than once, but do not rule out cycles of any length. 

3.2. Undirected acyclic graphs (trees) 

Complementing Section [2^ we observe that in undirected acyclic graphs, i.e., 
trees. Shortest Path Game becomes trivial: Since in a tree there is exactly 
one path between two dedicated vertices, the two player have no choice but to 
follow the unique path from s to t. Taking a diversion to a side branch, the 
players would always have to go back the same way they came implying a cycle 
of even length. 


4. Shortest Path Game on undirected cactus graphs 

Since Shortest Path Game is trivial on undirected trees and rather easy to 
solve in polynomial time on directed acyclic graphs we try to push the bar a bit 
higher and find polynomial algorithms on more general graph classes. In this 
section we will show that a cactus graph still allows a polynomial solution of 
Shortest Path Game. This is mainly due to a decomposition structure which 
allows to solve components of the graph to optimality independently from the 
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solution in the remaining graph. However, we will illustrate by an example in 
Section 14.41 that more general graphs, such as outerplanar graphs, which are a 
superset of cactus graphs, do not allow such a decomposition of the solution 
structure. This gives a certain indication that Shortest Path Game might 
become computationally intractable on slightly more general graphs. 



Figure 2: Graph G with connection strip G' (in black) and branches G\G' (in red). 

A cactus graph (also known as Husimi tree) is a graph where each edge is con¬ 
tained in at most one simple cycle. Equivalently, any two simple cycles have 
at most one vertex in common. This means that one could contract each cycle 
into a vertex in a unique way and obtain a tree. Note that cactus graphs are a 
subclass of series-parallel graphs and thus have treewidth at most 2. 

Considering Shortest Path Game, it is easy to see that the union of all 
simple paths from s to t define a subgraph G' consisting of a unique sequence 
of edges (which are bridges of the graph) and simple cycles. We will call G' the 
connection strip between s and t. All other vertices of the graph are “dead end 
streets”, i.e. edges and cycles branching off from G' (see Figure [5]). In the spe- 
path vertices in G \ G' could be included only to change the role of the decision 
maker in a vertex of G'. Clearly, any such deviation from G' must be a cycle 
rooted in some vertex of G'. Moreover, by (R2) only cycles (not necessarily 
simple) of odd length might be traversed in this way. This structural property 
gives rise to a preprocessing step where all vertices in G \ G' are contracted 
into a swap option in a vertex v G G' (see Figure[31) with cost {swd{v), swf{v)) 
meaning that if the path of the two players reaches a certain vertex v € G', 
the current decider has the option to switch roles (by entering an odd cycle in 
G \ G' rooted in v) at cost of swd{v) for himself (the decider) and swf{v) for 
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Figure 3: Graph G with connection strip G'\ branches G\G' are contracted into swaps, 
the other player (the follower). 

If there is more more than one cycle rooted in some vertex w, each of them is 
contracted separately. But since the spe-path can utilize at most one swap in v, 
we simply pick the swap option with the smallest cost for the decider. 

Our algorithm will first compute these swap costs by recursively traversing the 
components of G \ G' in Section H?T] Then, in the second step, the spe-path in 
G' is computed by moving backwards from t towards s in Section 14.21 In each 
iteration of this second step a cycle is considered, where the part of the spe- 
path from the “exit” of the cycle towards t is already known (resp. only a small 
number of possibilities remain). By evaluating several dynamic programming 
arrays, the best options for moving from the “entry” of this cycle to the exit are 
computed. These iterations are continued until the starting vertex s is reached. 


^.1. Contraction of the branches 

Consider a cycle C{vo) which is connected to the remaining graph only via vq and 
all other vertices of G(r!o) have degree 2, i.e. all other edges and cycles incident 
to these vertices were contracted into swap options before. For simplicity of 
notation we refer to vertices by their index number and assume that C(vo) = 
G(0) consists of a sequence of vertices 0,1, 2,..., /c — I, fc, 0. 

There are four possibilities how to use the cycle for a swap: The players could 
enter the cycle by the edge (0,1) and go around the full length of the cycle 
(possibly using additional swaps in vertices of the cycle) as depicted in FigurelH 
Or after edge (0,1) the players could move up to some vertex £ € {1,2,, kj, 
turn around by utilizing a swap option in £ and go back to 0 the way they came 
as depicted in FigureO Note that in the latter case, the players can not use any 
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Figure 4: Cycle with a possible path (in red) going round the cycle (with possible swaps 

on the way). 


additional swaps in vertices 1, — 1 (resp. k,k — since in that 

case the swap vertex would be visited three times in violation of (R2). Thus, we 
have to distinguish in each vertex whether such a turn around is still possible 
or whether it is ruled out by a swap in a previously visited vertex of the cycle. 

These two conhgurations can also be used in a laterally reversed way moving 
on the cycle in the different direction starting with the edge ( 0 , k) which yields 
four cases in total. 

Let D S {A^ B} be the decision maker in 0. We introduce the following generic 
notation for dynamic programming arrays: 

denotes the cost of a certain path starting in vertex i and 
ending in a fixed specified vertex. 

We use the subscript P S {d, /}, where P = d signifies that the cost occurs for 
the player deciding in i and P = f refers to the cost of the follower, i.e. the other 
player not deciding in i. Superscript ± S {+, —} shows that the decider in i is 
equal to Z? if ± = +, or whether the other player decides in i, i.e. ± = —. For 
simplicity, we also extend the cost range and use cost T if a path is infeasible. 
When taking the minimum of values, T stands for an arbitrarily large value. 
Adding the cost of a path to an array entry T yields again T. 

Following this system we define: 

tep(i) : minimal cost to move from i back to 0 , if a turn around is still possible. 
rcp{i) : minimal cost to move from i back to 0 , if no turn around is possible 
and the path has to go around the cycle, i.e. visit vertices i + l,i + 2,... ,k,0, 
with possible swaps on the way. If one player decides to turn around at some 
vertex i, the cost of the path back towards vertex 0 is completely determined 


18 








Figure 5: Cycle C{vq) with a possible path (in red) using a swap in vertex i. 


since no choices remain open. The corresponding costs are independent from D 
and will be recorded as patlip(j) in analogy to above. 

Now we can state the appropriate update recursion for the case where D chooses 
(0,1) as a first edge. We go backwards along this path and settle the minimal 
costs for vertices k,k—l,... ,1. The case where D moves into the other direction 
of the cycle is completely analogous and the final swap costs sw{vo) are given 
by the cheaper alternative. 


player A 



A moves to i + 1 swap in i 
Case 1. player B 



B moves to i + 1 B turns and 
Case 2a. returns to 0 

Case 2b. 

Figure 6: Decision tree for player A deciding in vertex i. 


In any vertex i the decision process has at most three ontcomes as illustrated 
in Figure ini 

1 . move on along the cycle to vertex i + 1: 
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rcj (i) := c{i, i + 1) + rcj (i + 1), rc^ (i) = rcj (i + 1) 
rc2 [i) := c(i, i + 1) + rc^ (i + 1), rcj (i) = rc^ (i + 1) 
tc^(i) := c(i, i + 1) + tcj (i + 1), tcj (i) = tcj {i + 1) 
tcj (i) := c(i, i + 1 ) + tc'j (i + 1 ), tcj {i) = tc'^(i + 1 ) 

2. make a swap (if available): Then the other player has (at most) two 
possibilities and chooses the one with the lower cost between 2 a. and 2 b. 
(or the only feasible choice), which automatically implies the cost for the 
decider in i. 

2 a. move on to vertex i + 1 : 

rc^(i) = swf{i) + c{i, i + 1 ) + rc^(* + 1 ), rc'^(i) := swd(i) + rc^(i + 1 ) 

rcj (i) = swf(i) + c(i, i + 1 ) + rcj (i + 1), rcj(i) := swd(i) + rcj(i + 1 ) 

(i) := swf{i) + c{i, i + 1 ) + rcj{i + 1 ), tcJ{i) := swd{i) + rcj{i + 1 ) 

tcj(i) := swf{i) + c{i, i + 1 ) + rcj {i + 1 ), tcj{i) := swd{i) + rcj{i + 1 ). 

2b. turn around (if possible): Since the decider at the end of the return path in 
vertex 0 must be different from the feasibility of a turn around depends 
on the number of edges between 0 and i. 

If i is even: 

tcj{i) := swd{i) +pathj(i), tc'j{i) := swf{i) +path^(i) 
tcj {i) ■■= T, tcj {i) := T 
If i is odd: 

tcj{i) := T, tcj{i) := T 

tcj{i) := swd(i) +path^(i), tcj{i) := swf{i) +path^(i) 

Now the decider in vertex i can anticipate the potential decision of the other 
player in case 2 ., since the other player will choose the better outcome between 
cases 2a. and 2b. (if 2b. is feasible). Hence, the decision maker in vertex i chooses 
the minimum between case I. and case 2. (if a swap is possible) independently 
for all four dynamic programming entries. This immediately implies the cost 
for the other player. 

It remains to discuss the initialization of the arrays for i = k, i.e. the last vertex 
in the cycle and thus the first vertex considered in the recursion. To avoid the 
repetition of all three cases implied by a possible swap at vertex k, we add two 
artificial vertices k + 1 and k + 2 and three artificial edges {k,k + l ), (fc + 1, fc + 2 ) 
and {k + 2,0) replacing the previous edge (fc, 0). We set c{k, k + 1) = c{k, 0) and 
the other two artificial edges have cost 0. It is easy to see that this extension of 
the cycle does not change anything. Now we can start the recursive computation 
at vertex k + 2 which has no swap option. (If vertex k does not have a swap 
option we can skip this extension and perform the following initialization for k 
replacing fc + 2.) We get: 

rcj{k + 2) := c{k + 2 , 0 ), rcj (fc + 2 ) = 0 
rcj (fc + 2) := T, rcj {k + 2) = + 
tcj{k + 2) := c{k + 2 , 0 ), tcj{k + 2) = 0 
tcj{k + 2) := T, tcj{k + 2) = T 
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4-2. Main part of the algorithm 


Now we proceed to the main part of the algorithm to determine the spe-path for 
moving from s to t along the connection strip G' after the reminder of the graph 
was contracted into swap options in vertices of G'. We traverse the connection 
strip backwards starting in t and moving upwards in direction of s. Recall that 
the connection strip consists of a sequence of cycles and edges. In the following 
we will focus on the computation of an optimal subpath for one cycle of this 
sequence. Each such cycle has two designated vertices which all paths from s to 
t have to traverse, an “upper vertex” vq through which every path starting in s 
enters the cycle, and a “lower vertex” ve through which all paths connecting the 
cycle with t have to leave the cycle. As before we refer to the vertices of the cycle 
simply by their index numbers and denote the cycle as 0 , 1 ,..., £, £ + 1 ,..., fc, 0 . 



Figure 7: Case (i): Forward swap in x. 


If we assume that the decision maker decides in 0 to take the edge (0,1) there 
are two main possibilities for the path from 0 to £ and onwards to the next cycle 
or edge. The other situation, where the decision maker starts with the edge 
(0, k) is completely symmetric and the decider will finally take the better of the 
two options. 

Case (i): The two players may move along the vertices 0,1,..., £, possibly with a 
few swaps on the way. After reaching £, they may either exit the cycle or continue 
to £ + 1 ,..., x, make a forward swap in x and return back via x — l,x — 2,... 
back to £ and finally exit the cycle (see Figure [7]). As a special variant of this 
situation, the players may also never swap in some vertex x but go back to 0 
thus traversing the full cycle and then taking the path 0 , 1 ,... ,£ a second time 
as depicted in Figure HI 

Case (ii): As a second, more complicated possibility, the two players may also 
move along vertices 0 , 1 ,..., j, j < £, and then utilize a swap in j and return to 
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Figure 8: Case (i), special case: Traverse the full cycle. 


0. Then they are forced to move on from 0 to A:, fc —1,..., After reaching I they 
may either exit the cycle directly or they may also continue to i — 1,1 — 2,... ,y 
with y > j, make a backward swap in y and return via y + 1 ,..., £ where they 
finally exit the cycle (see Figure [9]). 

Of course, all the resulting subpaths have to be feasible, in the sense that they 
actually lead to the desired swap between decider and follower and contain no 
even cycles. Moreover, all subpaths that are dead end streets used only for 
reaching a swap at some vertex are traversed twice and thus are not allowed to 
contain any swap except at their endpoints. In the above notation this holds 
for the subpaths I + 1,... ,x, the full cycle, 0 ,...,y and £,...,y. 



Figure 9: Case (ii): Backward swap in y. 
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From our backward processing through the connection strip we can assume that 
we already know the spe-path from Vi to t (without using any other vertex from 
the current cycle). If there is a bridge leading from vg towards t, it is easy to 
take the costs of the spe-path from Vi via this edge onwards to t into account. 
However, the situation gets more complicated if two cycles meet directly in one 
vertex, i.e. if the lower vertex vi of the current vertex is also the upper vertex of 
the following cycle in the connection strip towards t. In such a case we cannot 
combine the solutions of the two adjacent cycles in an arbitrary way since each 
player can decide in vertex Vi at most once. 

For the correct combination of solutions of two adjacent cycles it will be neces¬ 
sary to distinguish whether the path through the current cycle takes only one 
decision in Vi and then leaves the cycle (Case (i) without forward swap in x 
or Case (ii) without backward swap in y) or whether the hrst decision in vi 
leads to another vertex in the cycle and thus a second decision takes place in 
vi with the only possible result of leaving the cycle. In the former case, we do 
not have any restriction on the solution of the following cycle in the connection 
strip (closer to t) since the path is also allowed to move back to vi (which is the 
upper vertex of that cycle) as it is done in Case (ii). We denote by ed2vp (exit 
costs) the costs of the spe-path from vi to t allowing to visit Vi a second time. 
In the latter case, we only consider paths that never move back to ve in the 
following cycle (i.e. only Case (i) applies) and denote the cost of the spe-path 
from Vi to t under this restriction by edp. 

We now process the current cycle to determine the spe-path starting in vg- 
Keeping the notational system introduced above we will introduce the following 
dynamic programming arrays: 

td^{i) : minimal cost to move from i, i = 1, — 1, to £ and exit the cycle, if 

a turn around (according to Case (i)) is still possible. 

rdi^ii) : minimal cost to move from i, i = 1, ...,£ — 1, to £ and exit the cycle, 
if no turn around is possible. 

ad^{i) : (“alternative path”) minimal cost for moving from i, £ < i < k, via 
i — l,i — 2,... to £ (Case (ii)). We will later add a parameter j to the definition 
of this array since it may be combined with a backward swap (see below). 
/sp(z) : (“forward swap”) minimal cost for moving from i, £ < i < k, without 
swaps to some vertex x, i < x < k, make a swap in x and return to £ (Case (i)). 
The parity ± = -f indicates that the decider in i is the same as the decider in £. 
bs^(i, j) '■ {“backward swap”) minimal cost for moving from i, j < i < £, without 
swaps to some vertex y, j < y < i, make a swap in y and return to £ (Case 
(ii)). Note that parameter j indicates a lower bound on the position of the swap 
vertex y. Since we do not know at this point at which vertex j the path starting 
in 0 was stopped by a swap, we have to compute the values of this array for all 
values of j, j = 1, ...,£— 1. The parity ± = -I- indicates that the decider in i is 
the same as the decider in £. 

We hrst show how to compute the entries of the auxiliary dynamic programming 
arrays starting with the forward swap moving backwards from a starting point 
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i = k' — 1 (see below) down io i = ^. Let fpathp(i) denote the cost of the path 
from i back towards £. We have to take the minimum between the following 
two cases (if the second case exists): 

1 . move on along the cycle to vertex i + 1 : 

/s+ (i) := c{i, i + 1 ) + fs~j {i + 1 ), /s^(i) := fs~ (i + 1 ) 
fSd (0 := c(h * + 1 ) + fSf (i + 1 ), fsj (i) := fsj(i + 1 ) 

2. make a swap in i (if available and if i > £): Then the other player has 
to return to I to avoid a violation of (R2). The feasibility of such a turn 
depends on the number of edges between £ and i. 

If i ^ is even: 

(*) := swdii) + fpath^(z), /s^(z) := swf(i) + fpath^(z) 

/Sd(z) :=T, /s7(z) :=T 
If z — ^ is odd: 

/4(z) := T, /4(z) := T 

/s 7 (z) := swd{i) + fpathy(z), /s^(z) := szc/(z) + fpath^(z) 

As an initialization we start at a vertex with a swap option having the largest 
index k' <k and insert the values of the above Case 2. for z = fc'. 

Considering the backward swap we let bpathp(z) denote the cost of the path 
from z towards £. Now we have to perform the following computation for all 
values of j = 1, I. Again we have to take the minimum between the 

following two cases (if the second case exists) while we move up from z = j' + 1 
(defined below) to z = f. 

1 . move on along the cycle to vertex z — 1 : 

bSdihj) ■= c{i - l,z) + bsj{i - l,j), bs^{i,j) := bs';^{i - I, j) 
bSdihj) ■■= c{i - l,i) + bs^{i - l,j), bsj{i,j) := bs'^{i-l,j) 

2. make a swap in z (if available and if z < £): Then the other player has 
to return to £ to avoid a violation of (R2). The feasibility of such a turn 
depends on the number of edges between i and £. 

If £ — z is even: 

bSdihj) ■= swd{i) +bpath^(z), bs~j{i,j) := swf{i) + bpath^(z) 
bSdiiJ) ■■= T, bsj{i,j) := T 
If £ — z is odd: 

bsj{i,j) := T, bsj{i,j) := T 

bs^iij) := swdii) +hpathf{i), bsj{i,j) := s'u;/(z) + bpath^(z) 

As an initialization we start at a vertex with a swap having the smallest index 
j' > j and insert the values of the above Case 2. for z = f. 

Next we determine the path from 0 via k,k — 1,... io £, when no turn arounds 
are possible any more (Case (ii)). We consider the decision at vertex z and 
compute the entries of the dynamic programming array from z = £ + 1 on until 
i = k. In each vertex, the decision maker has to take the minimum between the 
following two cases (if the second case exists): 
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1 . move on along the cycle to vertex z — 1 : 

ad'^{i) := c{i — l,z) + adj {i — 1 ), ad'j{i) := ad^{i — 1) 

ad'^{i) := c(z — 1 ,z) + ad '^(z — 1 ), adj{i) := ad '^(z — 1 ) 

2. make a swap in i (if available): Then the other player has to continue 

moving along the edge (z — 1 , z) towards £ to avoid a violation of (R 2 ). 
ad'^ (z) := swd{i) + ad ^(z — 1 ), (z) := swf{i) + c{i — 1 , z) + ad '^(z — 1 ) 

ad'^{i) := swd{i) + o,d'^{i — 1 ), adj{i) := swf{i) + c{i — 1 ,z) + adj(i — 1 ) 

Finally, we extend the definition to z = 0 and compute ad^{0) according to case 
1 . with k replacing z — 1 . 

Note that all entries of adp are extended to include a parameter j in analogy to 
bsp which has no influence on the definition of the above recursion, although the 
array entries for different values of j may differ due to the different initialization. 

As an initialization we have to consider the decision in £ for Case (ii). The 
decider can either move on directly to the next cycle resp. edge in the connection 
strip or make a swap, either by a swap option available in £ from the previous 
contraction of G \ G' or by entering the backward swap. Because of (R2) at 
most one of the two swap variants can be part of the solution. Thus we have: 

ad'^{£,j) := miii{ed2vd,swd + edj, bs'^{£,j) + ed]:}, ( 1 ) 

0‘d'2{£,j) ■= min{ed 2 z;d, szud + ed}, j) + ed}}. 

The values for the follower are immediately implied by the selection of the 
minimum. 

Now we can turn to the main arrays and determine their values for all values 
of z by moving backwards from i = £ —1 down to z = 1. Let D € {A,B} 

again be the decision maker in 0. We let rpathp(z) denote the cost of the return 

path from z towards 0. In any vertex z the decision process has at most three 
outcomes, from which the cheapest is chosen. (Note that cases 1. and 2. are 
completely analogous to the earlier problem of contracting a cycle.) 

1 . move on along the cycle to vertex z + 1 : 

rd'^ (z) := c(z,z + 1 ) + rdj{i + 1 ), rd^ (z) = rd'2{i + 1 ) 
rd'2{k) := c(z,z + 1 ) + rd^ (z + 1 ), rdJ (i) = rd'^{i + 1 ) 
td)J'(z) := c(z,z + 1 ) + tdj (z + 1 ), tdj (z) = td'^{i + 1 ) 
td'^{k) := c(z, z + 1 ) + td '^(z + 1 ), tdj{i) = td '^(z + 1 ) 

2. make a swap (if available): Then the other player has (at most) two 
possibilities and chooses the one with the lower cost between 2 a. and 2 b. 
(or the only feasible choice), which automatically implies the cost for the 
decider in z. 

2 a. move on to vertex z + 1 : 

rdy (z) = swf{i) + c{i,i + 1 ) + rd^(z + 1 ), rd'^ (z) := swd{i) + rd'^ii + 1 ) 

rd'j(i) = swf(i) + c(i, z + 1 ) + rdJ (z + 1 ), rd'^{i) := swd{i) + rd'^{i + 1 ) 

td'^(i) := swf{i) + c(z,z + 1 ) + rd'^ (* + 1 ), td^ (*) := swd{i) + rd^(i + 1 ) 

td'j (z) := swf{i) + c(z, z + 1 ) + rdj (z + 1 ), tdj{i) := swd{i) + rdj{i + 1 ). 
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2b. turn around (if possible): Then the players have to go directly back to 0 
at cost rpathp(j) and then move to I at cost adp{0,i) (Case (ii)). Recall 
that the second parameter of ad, i in our case, indicates a lower bound on 
the end vertex of a possible backward swap. Since the decider in vertex 0 
must be different from D, the feasibility of a turn around in i depends on 
the number of edges between 0 and i. This yields: 

If i is even: 

td^ (i) := swd{i) + rpathjr(i) + ad'j (0, i), 
td'^{i) := swf{i) + rpath^(z) + ad'2 (0, i), 
td'^{i) := T, tdj {i) := T 
If i is odd: 

tdj{i) := T, td^{i) := T 

td^ (i) := swd{i) + rpathy (i) + ad'^ (0, i), 

tdJ (i) := swf{i) + rpath^(i) + adj (0, i) 

For the initialization we have to consider the decision in £ for Case (i). Note 
that for Case (ii) a decision was made in some vertex j to turn around. For this 
decision the cost of the full path from j back to 0 and then to the next cycle 
and onwards to t had to be taken into account. These costs were included in 
the initialization of adp in (HD. For Case (i) we proceed in a similar way: The 
decider can either move on directly to the next cycle resp. edge in the connection 
strip or make a swap, either by a swap option available in £ from the previous 
contraction of G \ G' or by entering the forward swap. Because of (R2) at most 
one of the two swap variants can be part of the solution. After a swap the other 
player has to move on to the next cycle resp. edge. Thus we have: 
rd'^{£) := mm{ed2vd, swd + edj, fs'^{£) + ed^}, 
rd^ {£) := min{ed2ud, swd + eddj, fs^{£) + ed^} 

The values for the follower are immediately implied by the selection of the 
minimum. 

In the case that a turn around is still possible, i.e. no swap is performed in 
any vertex between 0 and £, there is also the special case of reaching a swap 
by traversing the full cycle and repeating the path from 0 to £. Therefore we 

denote by (full cycle) the minimal cost to move from i, i = £,..., k, to 0 

with possible swaps on the way and then via 1,2,... to £ without swaps. The 
parity ± is aligned with the decider in £. The computation of /cp is very similar 
to /sp, but with the addition of the fixed path 0,1,..., £ instead of the return 
path. Thus we refrain from giving the details. The initialization for i = k has 
to observe the parity condition to guarantee that the going round the full cycle 
actually changes the role of the decider in £. We get: 
td^{£) := min{ed2nd, swd + edj, fs'^{£) + edj, fc^ + edj}, 
tdJ{£) ■■= m.\n{ed2vd, sWd + edj, /sj "{£) + ed}, /cj" + ed}} 

At the end of these computations we can determine the exit costs for the cycle 
or edge preceding the current cycle, i.e. the costs of the spe-path from vq to t. 
It is easy to see how these exits costs can be propagated along an edge leading 
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from vertex vq of the current cycle towards s. The general case of the spe-path 
for the current cycle yields ed^ := td'^{0) and edj- := td^(O). If the path for 
the current cycle is restricted and thus leaves more freedom of choice for the 
preceding cycle, we have to compute ed2vp by repeating all computations for the 
current cycle with the restriction that vq can be visited only once. This means 
that Case (ii) and the special case of Case (i), which traverses the full cycle, 
are simply removed from consideration and the reduced problem is computed 
as before. We refrain from giving the simple details of this process. 

Recall that we also have to consider the mirrored situation where D picks (0, k) 
as the first edge in the cycle and run all the computations once more with exactly 
opposite configurations. At the end, we determine each of ed\ and ed2vd as the 
minimum among among the two directions with the obvious consequences for 
the follower. 


For a directed cactus graph, i.e. a directed graph which can be obtained from 
an undirected cactus graph by assigning a direction to each edge, the number 
of possibilities for the two players to explore a cycle is much more restricted. 
A cycle may be directed in a cyclic order and thus allow a swap option (in the 
contraction of branches) or a full cycle in the main part of the algorithm (Case 
(i), special case), both in a unique way. Or a cycle permits two paths from vg to 
Vi in the main part of the algorithm, which can be settled by simply exploring 
both options for the decider in vg- In this case, no swap is possible and such 
a cycle can be eliminated if it is in G \G'. Finally, a cycle may leave only 
a unique way for its traversal thus being equivalent to a sequence of edges or 
contain directions that leave it blocked. In all cases, we can find the spe-path 
in polynomial time by a considerably simplified version of the above algorithm. 


4-3. Running time 

Let n := \V\. The overall execution of the algorithm is based on the tree 
structure inherent in every cactus graph by contracting its cycles. The algorithm 
first considers all subtrees of this tree lying outside the unique path from s to 
t and contracts recursively all leaves of these subtrees to their parent vertex. 
Then the main path is resolved moving from t to s. All together, each cycle 
is contracted into a constant number (2 or 4) of cost values (swap option or 
exit costs) containing all the information of previously considered parts of the 
graph. Thus, the overall running time is given by the sum of running times for 
the contraction of all cycles in G' and G\G'. 

Considering the computation for one cycle of the connection strip G', we can 
observe that each entry of the dynamic programming arrays can be computed 
in constant time. Entries of the arrays are computed once by moving along 
certain parts of the cycle which yields on overall linear running time (linear in 
the number of vertices of the cycle). This also applies to the arrays used in the 
branch contraction of G \ G' performed in Section 14.11 There are two notable 
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exceptions from this property, namely the backward swap bs^{i,j) and the alter¬ 
native path ad^{i,j) which have a quadratic number of entries to be computed, 
each of them in constant time. Thus, we have the following statement. 

Theorem 4. The spe-path of Shortest Path Game on undirected cactus 
graphs can be computed in 0(pf) time. 

Since backward swaps do not apply for directed graphs we can state immediately. 


Corollary 5. The spe-path o/S hortest Path Game on directed cactus graphs 
can be computed in 0(n) time. 

4 . 4 . More general classes of graphs and some observations 

The main reason why Shortest Path Game is solvable in polynomial time 
on cactus graphs, which have treewidth 2, is the fact that optimal solutions of 
subgraphs can be used for deriving an optimal solution of the whole graph. In 
this section we give a simple example showing that this does not work anymore 
for outerplaner graphs, which still have treewidth 2 and are a superclass of 
cactus graphs: In Figure [TU] the path corresponding to the optimal strategy is 
illustrated by giving edges chosen by player 1 the color red and its opponent 
blue. The spe-path has length (2,1) and goes through vertices a and b. However, 
when restricting the problem to the subgraph G \ {s,t} (framed by the dashed 
line in the third picture of Figure llOl) a spe-path from a to 6 with player 2 starting 
in a consists of simply choosing edge (a, b) with length (0,0). If however player 
2 would choose this edge when playing the game on the whole graph, player 1 
would choose the edge of length 1 forcing player 2 to use the edge of length M 
in order to reach t. Thus, the connection from a to b traversed in the global 
spe-path is different from the local subgame perfect equilibrium path from a to 
b. 

For narrowing the gap between the positive result for cactus graphs (having 
treewidth 2) and the negative result of general (and bipartite) graphs, it would 
be interesting to consider graphs of bounded treewidth, as it was done for Ge¬ 
ography. However, a potential PSPACE-completeness result for Shortest 
Path Game on bounded treewidth graphs along the lines of the proof of The¬ 
orem [ 3 ] does not seem to be within reach. To be in line with that proof, we 
would need a restricted variant of Quantified 3-Sat satisfying the property 
of having respectful bounded treewidth (cf. Atserias and Oliva [l|) in order to 
achieve bounded treewidth of the graph used. However, Atserias and Oliva [l| 
note that bounded treewidth quantified boolean formula become polynomial 
time solvable when restricted to respectful bounded treewidth instances by a 
result of Chen and Dalmau Q. Thus, a potential PSPACE-completeness result 
for Shortest Path Game on bounded treewidth graphs will likely require a 
different approach; the computational complexity of Shortest Path Game on 
bounded treewidth graphs remains an interesting open question. 
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spe-path 



Figure 10: Example where the connection from a to 6 used in the global spe-path from s to t 
is disjoint form the local spe-path from a to b. Unlabeled edges have cost 0. 
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